Skip to content

feat(platform): add phase8 real load execution evidence pipeline#22

Merged
ActiveInAI merged 4 commits into
mainfrom
feat/platform-phase8-real-load-execution-evidence
May 1, 2026
Merged

feat(platform): add phase8 real load execution evidence pipeline#22
ActiveInAI merged 4 commits into
mainfrom
feat/platform-phase8-real-load-execution-evidence

Conversation

@ActiveInAI

Copy link
Copy Markdown
Owner

Summary

Adds Phase 8.2 real load execution evidence pipeline so ArchIToken can progress from 100k certification gates to machine-verifiable real execution evidence.

Scope

  • Extends the Phase 8 load evidence contract with staged smoke/1k/10k/25k/50k/100k execution results.
  • Requires immutable artifact binding:
    • git SHA
    • K8s manifest hash
    • Docker image digest
    • k6 script hash
  • Requires Prometheus, Grafana, and OpenTelemetry evidence before certification.
  • Adds Prometheus snapshot collection tooling.
  • Adds evidence merge tooling for k6 summary + Prometheus snapshot + K8s state + git metadata.
  • Adds live K8s runtime validation tooling.
  • Adds staged load execution scripts:
    • smoke
    • 1k
    • 10k
    • 25k
    • 50k
    • 100k
  • Adds final certification-from-evidence script.
  • Adds Phase 8.2 docs, bottleneck playbook, and certification report template.
  • Updates Phase 8.1 documentation to state that PR feat(platform): add phase8 real 100k load certification gates #21 introduced gates only and does not certify real 100k concurrency.

Validation

  • rm -f 04-backend/openapitools.json
  • git diff --check
  • python3 -m unittest tools/test_phase8_load_evidence.py
  • python3 -m unittest tools/test_phase8_prometheus_snapshot.py
  • python3 -m unittest tools/test_phase8_merge_load_evidence.py
  • python3 -m unittest tools/test_phase8_runtime_cluster_validation.py
  • bash -n 04-backend/scripts/phase8-run-smoke.sh
  • bash -n 04-backend/scripts/phase8-run-1k.sh
  • bash -n 04-backend/scripts/phase8-run-10k.sh
  • bash -n 04-backend/scripts/phase8-run-25k.sh
  • bash -n 04-backend/scripts/phase8-run-50k.sh
  • bash -n 04-backend/scripts/phase8-run-100k.sh
  • bash -n 04-backend/scripts/phase8-certify-from-evidence.sh
  • cargo fmt --all -- --check
  • cargo clippy --all-targets --all-features -- -D warnings
  • cargo test --all-targets --all-features
  • cargo build --release --bin insomeos-gateway
  • 04-backend/scripts/guard-proprietary-runtime.sh

Boundary

No real external 100k load run was executed in this PR.

This PR does not claim ArchIToken is certified for 100k concurrent users. It adds the execution evidence pipeline required to run, merge, validate, and certify real external 100k load evidence.

No proprietary RealBIM / BlackHole / OptRapid runtime asset is imported into open-source core.

@chatgpt-codex-connector chatgpt-codex-connector Bot left a comment

Copy link
Copy Markdown

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

💡 Codex Review

Here are some automated review suggestions for this pull request.

Reviewed commit: f9426e7968

ℹ️ About Codex in GitHub

Your team has set up Codex to review pull requests in this repo. Reviews are triggered when you

  • Open a pull request for review
  • Mark a draft as ready
  • Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

Codex can also answer questions or update the PR. Try commenting "@codex address that feedback".

"stage": stage,
"start_time": str(metadata.get("start_time", "")).strip(),
"end_time": str(metadata.get("end_time", "")).strip(),
"vu": int(metadata.get("vu", metric(summary, "vus_max", "max", 0))),

Copy link
Copy Markdown

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

P1 Badge Use observed VUs when computing achieved concurrency

The merge step currently sets each stage vu from stage metadata first, and those metadata files are generated by the run scripts using the target VU constant rather than an observed value. This allows a run that never actually reached 100k VUs to still produce achieved_concurrency=100000 and pass certification gates, because validation only checks the merged achieved_concurrency field. Use k6-observed metrics (for example vus_max) as the source of truth and treat metadata as informational.

Useful? React with 👍 / 👎.

Comment on lines +189 to +190
"dropped_connections": int(metrics.get("realtime.dropped_connections", 0)),
"gateway_restarts": int(metrics.get("gateway.restarts", 0)),

Copy link
Copy Markdown

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

P1 Badge Fail when restart/drop metrics are missing from Prometheus data

These fields default to 0 when absent, but the bundled snapshot collector (tools/phase8_collect_prometheus_snapshot.py) does not query realtime.dropped_connections or gateway.restarts, so merged evidence will report zero by default and can incorrectly satisfy threshold checks during real instability. This creates false-positive certifications unless missing critical metrics are treated as an error (or explicitly queried and required).

Useful? React with 👍 / 👎.

Comment on lines +127 to +130
pods = [item for item in snapshot.get("items", []) if item.get("kind") == "Pod"]
for pod in pods:
if not pod_ready(pod):
errors.append(f"Pod/{pod.get('metadata', {}).get('name', 'unknown')} is not ready")

Copy link
Copy Markdown

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

P2 Badge Restrict readiness checks to relevant workload pods

The validator marks the snapshot invalid if any pod in the namespace is not Running and ready, including unrelated completed Job/CronJob pods. In a production-like namespace with normal completed maintenance/migration pods, this will block certification even when all required gateway/stateful components are healthy. The readiness loop should scope to the certification workloads (or ignore terminal Succeeded pods) to avoid false failures.

Useful? React with 👍 / 👎.

@ActiveInAI ActiveInAI merged commit 153aec5 into main May 1, 2026
6 checks passed
@ActiveInAI ActiveInAI deleted the feat/platform-phase8-real-load-execution-evidence branch May 1, 2026 05:35
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant